Search for: All records

Creators/Authors contains: "Sobel, Eric M."

« Prev Next »

Total Resources

4

Resource Type
Conference Paper

0

Conference Proceeding

0

Dataset

0

Journal Article

4

Workshop Report

0

Availability
Full Text / Resource Available

4

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Unsupervised discovery of ancestry-informative markers and genetic admixture proportions in biobank-scale datasets

https://doi.org/10.1016/j.ajhg.2022.12.008

Ko, Seyoon ; Chu, Benjamin B. ; Peterson, Daniel ; Okenwa, Chidera ; Papp, Jeanette C. ; Alexander, David H. ; Sobel, Eric M. ; Zhou, Hua ; Lange, Kenneth L. ( February 2023 , The American Journal of Human Genetics)

Full Text Available
A fast data-driven method for genotype imputation, phasing and local ancestry inference: MendelImpute.jl

https://doi.org/10.1093/bioinformatics/btab489

Chu, Benjamin B ; Sobel, Eric M ; Wasiolek, Rory ; Ko, Seyoon ; Sinsheimer, Janet S ; Zhou, Hua ; Lange, Kenneth ( July 2021 , Bioinformatics)
Kelso, Janet (Ed.)
Abstract Motivation Current methods for genotype imputation and phasing exploit the volume of data in haplotype reference panels and rely on hidden Markov models (HMMs). Existing programs all have essentially the same imputation accuracy, are computationally intensive and generally require prephasing the typed markers. Results We introduce a novel data-mining method for genotype imputation and phasing that substitutes highly efficient linear algebra routines for HMM calculations. This strategy, embodied in our Julia program MendelImpute.jl, avoids explicit assumptions about recombination and population structure while delivering similar prediction accuracy, better memory usage and an order of magnitude or better run-times compared to the fastest competing method. MendelImpute operates on both dosage data and unphased genotype data and simultaneously imputes missing genotypes and phase at both the typed and untyped SNPs (single nucleotide polymorphisms). Finally, MendelImpute naturally extends to global and local ancestry estimation and lends itself to new strategies for data compression and hence faster data transport and sharing. Availability and implementation Software, documentation and scripts to reproduce our results are available from https://github.com/OpenMendel/MendelImpute.jl. Supplementary information Supplementary data are available at Bioinformatics online.
more » « less
Full Text Available
Modern simulation utilities for genetic analysis

https://doi.org/10.1186/s12859-021-04086-8

Ji, Sarah S. ; German, Christopher A. ; Lange, Kenneth ; Sinsheimer, Janet S. ; Zhou, Hua ; Zhou, Jin ; Sobel, Eric M. ( May 2021 , BMC Bioinformatics)

Abstract Background
Statistical geneticists employ simulation to estimate the power of proposed studies, test new analysis tools, and evaluate properties of causal models. Although there are existing trait simulators, there is ample room for modernization. For example, most phenotype simulators are limited to Gaussian traits or traits transformable to normality, while ignoring qualitative traits and realistic, non-normal trait distributions. Also, modern computer languages, such as Julia, that accommodate parallelization and cloud-based computing are now mainstream but rarely used in older applications. To meet the challenges of contemporary big studies, it is important for geneticists to adopt new computational tools.
Results
We present , an open-source Julia package that makes it trivial to quickly simulate phenotypes under a variety of genetic architectures. This package is integrated into our OpenMendel suite for easy downstream analyses. Julia was purpose-built for scientific programming and provides tremendous speed and memory efficiency, easy access to multi-CPU and GPU hardware, and to distributed and cloud-based parallelization. is designed to encourage flexible trait simulation, including via the standard devices of applied statistics, generalized linear models (GLMs) and generalized linear mixed models (GLMMs). also accommodates many study designs: unrelateds, sibships, pedigrees, or a mixture of all three. (Of course, for data with pedigrees or cryptic relationships, the simulation process must include the genetic dependencies among the individuals.) We consider an assortment of trait models and study designs to illustrate integrated simulation and analysis pipelines. Step-by-step instructions for these analyses are available in our electronic Jupyter notebooks on Github. These interactive notebooks are ideal for reproducible research.
Conclusion
The package has three main advantages. (1) It leverages the computational efficiency and ease of use of Julia to provide extremely fast, straightforward simulation of even the most complex genetic models, including GLMs and GLMMs. (2) It can be operated entirely within, but is not limited to, the integrated analysis pipeline of OpenMendel. And finally (3), by allowing a wider range of more realistic phenotype models, brings power calculations and diagnostic tools closer to what investigators might see in real-world analyses.

more » « less
Fast Genome‐Wide QTL Association Mapping on Pedigree and Population Data

https://doi.org/10.1002/gepi.21988

Zhou, Hua ; Blangero, John ; Dyer, Thomas D. ; Chan, Kei‐hang K. ; Lange, Kenneth ; Sobel, Eric M. ( December 2016 , Genetic Epidemiology)